
This archive contains replication materials for "Underproduction Analysis of Open Source Software". This archive contains the extension materials to be used in conduction with two other dataverses: 

Champion, Kaylea; Hill, Benjamin Mako, 2021, "Replication data and online supplement for: Underproduction: An Approach for Measuring Risk in Open Source Software", https://doi.org/10.7910/DVN/PUCD2P, Harvard Dataverse, V2, UNF:6:A8MV1fxlZnJtlKI3DnGaRg== [fileUNF]

and

Kaylea Champion, 2024, "Replication Data for: Sources of Underproduction in Open Source Software", https://doi.org/10.7910/DVN/N2HIRS, Harvard Dataverse, V1 

You will need the contents of all three archives (including this one) to fully replicate the materials in "Underproduction Analysis of Open Source Software". 

Step 1. Follow the instructions for Champion and Hill 2021. This will produce the materials viewable as Experiment 1 and Experiment 2 part 1 and 3.

Step 2. Follow the instructions for Champion 2024. This will produce the materials viewable as Experiment 3.

Step 3. The following guide will allow you replicate materials from Experiment 2 part 2 and the Extended Analysis produced as part of the Discussion.

a) Obtain Data:
	i) CoLIS data via scraping using scrapeCOLIS.py. CoLIS data was cleaned as described in colisNotes.txt. My results are in colisFoundBugs.tsv.
	ii) Obtain Debian bullseye data via calls to the UDD described in UDDfetch.txt. My results are in bullseye_packages.txt.
	iii) Obtain Popcon data via API calls using scrapePopcon.py. Popcon data was cleaned as described in popconNotes.txt. My results are in bullseye_usage_20220215.tsv.

b) Run analysis in R, first by editing the globals to match your environment and paths; lib-00-utils.R will be loaded as part of this process.:
	i) prepData.R will build the datasets.
	ii) standalone.R will conduct the analysis.
	iii) visuals.R will build the figures.



Please let me know if you have any questions -- given the number of analyses in this paper, I may be able to save you a lot of time by pointing you to just the part you're looking for. Kaylea Champion (kaylea@uw.edu ; khascall@gmail.com)


